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Abstract 

Although it has been notoriously difficult to pin down precisely what it 
is that makes life so distinctive and remarkable, there is general agreement 
that its informational aspect is one key property, perhaps the key prop- 
erty. The unique informational narrative of living systems suggests that 
life may be characterized by context-dependent causal influences, and in 
particular, that top-down (or downward) causation - where higher-levels 
influence and constrain the dynamics of lower-levels in organizational hi- 
erarchies - may be a major contributor to the hierarchal structure of 
living systems [11 [H [31 II]- Here we propose that the origin of life may 
correspond to a physical transition associated with a fundamental shift 
in causal structure. The origin of life may therefore be characterized by 
a transition from bottom-up to top-down causation, mediated by a re- 
versal in the dominant direction of the flow of information from lower to 
higher levels of organization (bottom- up), to that from higher to lower 
levels of organization (top-down). Such a transition may be akin to a 
thermodynamic phase transition, with the crucial distinction that deter- 
mining which phase (nonlife or life) a given system is in requires dynamical 
information and therefore can only be inferred by identifying causal rela- 
tionships. We discuss one potential measure of such a transition, which 
is amenable to laboratory study, and how the proposed mechanism cor- 
responds to the onset of the unique mode of (algorithmic) information 
processing characteristic of living systems. 

1 Introduction 

A landmark event in the history of science was the publication in 1859 by 
Charles Darwin of his book On the Origin of Spectes [5 , affording for the 
first time in history, a scientific framework unifying all life on Earth under 
a common descriptive paradigm. However, while Darwin's theory gives a 
convincing explanation of how life has evolved incrementally over billions 
of years from simple microbes to the richness of the biosphere we observe 
today, Darwin pointedly left out an account of how life got started in the 



first place, "One might as well speculate about the origin of matter", he 
quipped. A century and a half later - although the origin of matter has 
been fairly well worked out - scientists still remain largely in the dark 
about life's origins. It would not be an exaggeration to say that the origin 
of life is one of the greatest unanswered questions in science. 

The origin of life in fact constitutes three related but distinct ques- 
tions: when did it happen; where it happen; and how did it happen? 
Progress toward understanding the first two of these questions has been 
markedly more successful than addressing the third. The "when" ques- 
tion is perhaps the best understood. There is good evidence that life on 
Earth was well established as early as 3.5 billion years ago [6], and pos- 
sibly as early as 3.8 billion years ago [7]. Finding still earlier traces of 
biological activity presents a considerable challenge, but astrobiologists 
generally agree that conditions - at least on Earth's surface - were hostile 
to life prior to about 4 billion years ago ^ (however the Earth's interior 
may have been habitable even earlier [9]), leaving an estimated window 
for life's emergence on Earth of just a few hundred million years. As far as 
where life began, a number of plausible settings have been suggested, in- 
cluding hydrothermal vent systems [lOllllj . drying lagoons or tidal shores 
[121 113j . deserts [14], mineral surfaces [151 1161 fTT] . the eutectic phase of 
ice [181 119| . and deep underground [5D|. Since any traces of life's origin 
have been obscured by the several billion years of ensuing evolutionary 
history (both of the geosphere and the biosphere), it may prove impossi- 
ble to identify a precise setting [5T]. In fact, it is not even clear that life 
began on Earth. A plausible scenario is that it started on Mars (which 
may have been somewhat more favorable than Earth for life prior to 3.8 
billion years ago [H]), and came to Earth inside impact ejecta [53]. Other 
extraterrestrial biogenesis scenarios have also been suggested. 

Here we sidestep the when and where issues, and address the hardest 
of the three origins questions: how did life begin? Of the many open 
questions surrounding how life emerges from non-life, perhaps the most 
challenging is the vast gulf between very complex chemistry and the most 
simple biology: even the simplest known biological organisms seem im- 
measurably more complex than any chemical reaction network we might 
engineer in the laboratory with current technology. The chemist George 
Whitesides, for example, has stated, "How remarkable is life? The answer 
is: very. Those of us who deal in networks of chemical reactions know of 
nothing like it." The heart of the issue is that we do not know whether the 
living state is "just" very complex chemistry, or if there something fun- 
damentally distinct about living matter. Right at the outset we therefore 
face a deep conceptual problem, asked long ago by the physicist Erwin 
Schrodinger [2^, namely. What is Life? Without a definition for life, the 
problem of how life began is not well posed. 

Given the immense challenge presented by addressing the problem 
head-on, the issue of defining life is typically averted by applying the Dar- 
winian criterion and assuming that if one can build a simple chemical 
system capable of Darwinian evolution the rest will follow suit and the 
question of the origin of life will be solved [251 1261 127| . Thus, although 
few are likely to be willing to accept a molecular self-replicator as living, 
the assumption is that Darwinian evolution will invariably lead to some- 
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thing everyone would agree is "alive" [28]. Darwinian evolution applies 
to everything from simple software programs, molecular replicators, and 
memes, to systems as complex as multicellular life and even potentially 
the human brain [29] - therefore spanning a gamut of phenomena ranging 
from artificial systems, to simple chemistry, to highly complex biology. 
The power of the Darwinian paradigm is precisely its capacity to unify 
such diverse phenomena, particularly across the tree of life - all that is re- 
quired are the well-defined processes of replication, heritable variation and 
selection. However, this is also the greatest weakness of the Darwinian 
paradigm as applied to the origin of life - it is too general and therefore 
provides no means for distinguishing "complex" from "simple", let alone 
life from non-life. Hence Darwin's own trepidation in speculating on life's 
origin. 

Therefore, to make progress in understanding the origin of life, a key, 
and frequently understated, distinction must be made between the evolu- 
tionary processes which potentially enable life to emerge and the mecha- 
nism(s) underlying the physical transition itself. Although it is notoriously 
hard to identify precisely what it is that makes life so distinctive and re- 
markable [301 1311 132| , there is general agreement that its informational 
aspect is one key property, perhaps the key property [33l[34l|35l|36j . If 
life is more than just complex chemistry, its unique informational aspects 
may be the crucial indicator of this distinction. The manner in which 
information flows through and between cells and sub-cellular structures 
is quite unlike anything else observed in nature. The catch is that the 
unique way information is processed in biology is not easily characterized 
|34U37j . While standard information-theoretic measures, such as Shannon 
informatiorQ have proved useful and relevant in many areas of biology, 
particularly in the burgeoning field of bioinformatics, much of the usage of 
informational language in biology seems to make use of a richer and more 
problematic concept than that tackled by Shannon [3H]. The archetypal 
example is provided by DNA. A misleading, but commonly held view- 
point is that DNA provides the blueprint for an organism. A blueprint 
has a one-to-one correspondence between the symbolic representation and 
the actual object it describes. In reality, DNA is not a blueprint per se 
but rather provides a set of algorithmic instructions for building a cell, 
which is fundamentally different from a blueprint. The algorithm stored 
in DNA may be executed only within the contextual framework provided 
by the cellular environment. This "contextual" - or "functional" - aspect 
of biological information sets it apart from a collection of arbitrary bits 
as characterized by its Shannon information. In this respect, biological 
information shares some common ground with the philosophical defini- 
tion of semantic information (which is more commonly - and rigorously 
- applied in the arena of "high-level" phenomena such as language, per- 
ception and cognition) |37j . Obviously one cannot quantify the semantic 
content of a DNA sequence in terms of local variables alone, because con- 
textuality is a global property, and this makes it difficult to develop a 

^The Shannon Entropy (information) of a random variable X is given by H(X) = 
— X]a;gA' P(^) 1°SP(2;) where X is the set of all possible states X and quantifies the number of 
bits of information in a given string. 
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mathematical model of contextual information. Without such a model, 
however, the problem of how contextual information emerged in the first 
place from "mere bits" is formidable. Nevertheless, we consider that it is 
this transition that is the key defining characteristic of what is implicitly 
referred to when one speaks of the "emergence of life" , amounting to the 
emergence of context-dependent causal influences. In this paper, we iden- 
tify the transition from non-life to life with a fundamental shift in causal 
structure, where a high-level entity (i.e. contextual information) gains 
direct, and context-dependent, causal efficacy over matter. 

The paper is organized as follows. We first introduce the key chal- 
lenges in identifying the unique mode of information processing in living 
systems and review historical approaches to information and the origin of 
life. Based on this discussion we then reframe the origin of life as a tran- 
sition in how information is dynamically processed leading to the onset of 
context dependent causal influences, and introduce the conceptual frame- 
work of self-referential dynamics - a key facet of context-dependent causa- 
tion. Using the implementation of self-reference in physical and artiflcial 
systems, we outline how the von Neumann paradigm for self-replicating 
automata elucidates key features of information flow in biological organi- 
zation. We also discuss implications for evaluating the status of a RNA 
world as living or nonliving. Next we introduce the conceptual framework 
of top-down causation and illustrate how a transition from bottom-up to 
top-down information flow is correlated with the emergence of collective 
behavior in a toy-model system describing replicative systems. We then 
outline the central claim of this paper discussing how a transition in causal 
structure may have driven the emergence of life. We conclude with some 
suggestions about possible methods via which our hypothesis might be 
tested as well as applications to experimental systems. 

2 Life and Information 

Over the past several decades the concept of information has gained a 
prominent role in many areas of biology. We routinely use terminol- 
ogy such as "signaling", "quorum sensing" and "reading" and "writing" 
genetic information, while genes are described as being "transcribed", 
"translated" , and "edited" , all implying that the informational narrative 
is aptly applied in the biological context. Despite this widespread use 
of informational terminology, the precise nature of biological information 
remains infamously diflicult to define (see e.g. for a detailed discus- 
sion). The challenge primarily stems from the semantic or contextually- 
dependent nature of biological information, leading to a unique mode of 
information processing in biological systems that differs from that of non- 
living dynamical systems [i.e. anything that is not biological or derivative 
of a biological process) such as a computer. 

All dynamical systems process information in a trivial sense. The 
physical state of a system at time ti is mapped into the state at a later 
time t2 in accordance with the laws of physics and the imposed boundary 
conditions. This mapping can be viewed as an algorithm describing the 
time evolution of the system. Thus, for example, Newtonian mechanics 
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provides the algorithm that maps the state of the solar system today onto 
its state tomorrow by specifying a trajectory through phase space. One 
may describe this input-output mapping using Shannon's definition of in- 
formation combined with some form of course graining. However, this 
is not the canonical description of such dynamics since the informational 
narrative, introduced only within the last century, typically does not hold 
much utility above and beyond more traditional descriptions - the algo- 
rithm is fixed such that the information content of a given state does not 
actively influence the laws of dynamical evolution. This state of affairs is 
characteristic when information plays a passive role. 

The situation is fundamentally different in biology. Biological infor- 
mation possesses an additional active quality that may roughly be called 
functionality j34l 1361 [37] . Consider as an example the nature of DNA as 
discussed in the introduction. On the one hand it is a physical structure, 
but on the other hand it is a digital information repository. A given DNA 
strand may contain special sequences called genes, or it may contain what 
is commonly referred to (perhaps misleadingly) as "junk" DNA. A key 
property of a gene is that it is a set of instructions - an algorithm - for 
the translational machinery to make a protein, whereas the "junk" - bet- 
ter described as sequences of "non-coding" DNA - is not. To say that 
information is "instructional" (or algorithmic) and "coded" represents a 
crucial conceptual leap - separating the biological from the non-biological 
realm - implying that a gene is "for" something. A gene has a function, 
and the processing of genetic information specifically implements that 
functionality, thereby bestowing upon life its distinctive "lifelike" quality 
(as opposed to an inert or arbitrarily changing complex system). But 
coded instructions are useless unless there is a system that can decode, 
interpret and act on those instructions. A gene that is silenced [e.g. by 
methylation) has the same information content, but it is stranded, unable 
to do anything biologically functional. An arbitrary sequence (even if un- 
methylated) inserted into DNA would very likely not possess functionality 
either, but at the molecular level a functional gene and an arbitrary base- 
pair sequence are physically indistinguishable. Without prior knowledge 
one could not tell, based on purely local structure and sequence infor- 
mation, whether a given sequence is part of a gene (i.e. functional) or 
not. 

Biologically functional information is not an additional quality, like 
electric charge, painted onto matter and passed on like a token. A gene is 
meaningful, or biologically functional, only within the context of the cell, 
where a suite of molecular hardware collaborate in decoding and execut- 
ing the encoded instruction [e.g. to make a protein). As such, biological 
information is an abstract global systemic entity, carrying meaning only 
within the context of an entire living system. It is of course imprinted 
in biochemical structures, but one cannot point to any specific structure 
in isolation and say "Aha! Biological information is here!": even the in- 
formation in genes is only efficacious and manifested in a relational sense 
(i.e. it must be decoded by the appropriate cellular machinery). The 
implication of identifying information as algorithmic or "coded" repre- 
sents a major conceptual leap, separating the biological realm from the 
non-biological. One might say that information is delocalized in biological 
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systems. This leads to a very different story tlian tlie trivial information 
processing encountered in traditional approaches to dynamics as outlined 
above. In biology information appears to have causal efficacy |39l 140] - 
it is the information that determines the state and hence the dynamics. 
The causal efficacy of information is made possible due to the separation 
of information and its physical representation. It is the efficacy of infor- 
mation - i.e. being "for" something - leads to a convolution of dynamical 
laws and states, where the state determines the dynamics and the dy- 
namics determine the state, thus standing in stark contrast to traditional 
reductionist descriptors of physical phenomena developed over the last 
300 years where the dynamical laws have always remained fixed [41] . 

This state of affairs is perhaps most evident for the case of the genome- 
proteome systems, in which the expressed set of algorithmic instructions 

- i.e. the relative level of gene expression - depends on the state of the 
system - e.g. the composition of the proteome, environmental factors, etc. 

- that in turn regulate the switching on and off of individual genes. There- 
fore, life is characterized by a situation where the dynamics are coupled to 
the information content of biological states such that the dynamics of the 
system change with the states and vice versa |41l 142] . Biological (func- 
tional) information is therefore inherently nonlocal with the result is that 
the update rules change with time in a manner which is both a function 
of the current state and the history of the organism (perhaps suggesting 
that even the concept of evolution itself may be in need of revision see e.g. 
Goldenfeld and Woese [421 143] for an engaging discussion). Here casting 
the problem in the context of an informational narrative is crucial - the 
algorithm describing the evolution of a biological system changes with the 
information encoded in the current state and vice versa. This feature of 
"dynamical laws changing with states" |41| . as far as we know, seems to 
be unique to biological organization and is a direct result of the pecu- 
liar nature of biological information and therefore serves as a contender 
for defining life (although speculative examples from cosmology have also 
been discussed, see e.g. [44]'). 

3 Information and the Origin of Life: Tra- 
ditional Approaches 

The widespread use of the informational narrative in biology suggests that 
understanding the origin of biological information is essential to our un- 
derstanding of the origin of life 33 , 34 , 35 . Much research on the origin 
of life has therefore focused on how information was generated and inte- 
grated into the first living systems [23|25l|3l|45llMliailllll3l5Ql[5l]. 
In particular, a longstanding debate in studies of the origin of life - of- 
ten dubbed the chicken or the egg problem - is which came first, genetic 
heredity or metabolism [211 152j . A conundrum arises because neither can 
operate without the other in contemporary life, where the duality is man- 
ifested via the genome/proteome systems. The origin of life community 
has therefore tended to split into two camps, loosely labeled as "genetics- 
first" and "metabolism-first." In informational language, genetics and 
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metabolism may be unified under a common conceptual framework by 
regarding metabolism as a form of analog information processing (to be 
explained below), to be contrasted to the digital information of genetics. 
In modern life, the proteome functions mostly as an analog computer, 
where the tasks to be executed are defined by protein expression (i.e. the 
protein composition of the cell at a given time|^ However, the information 
is stable over geological timescales because DNA provides a stable digital 
repository, subject to error correction, allowing information to be prop- 
agated from generation to generation digitally. Yet, even in modern life 
DNA alone does not capture the full information content of a cell: the full 
package of a cell complete with ribosomal translation machinery (digital 
to analog conversion) must also be passed on for inherited DNA to mean 
anything at all. The chicken-or-egg problem as traditionally posed thus 
amounts to a debate of whether analog or digital information processing 
came first. 

In approaching this debate, a common source of confusion stems from 
the fact that molecules play three fundamentally distinct roles: struc- 
tural, informational and chemical. To use computer language, in living 
systems chemistry corresponds to hardware and genetic information to 
software |23j . DNA molecules are undeniably a form of hardware, but the 
information they instantiate is software. As we outline below, almost all 
discussion about genetics-first versus metabolism-first scenarios for the 
origin of life concentrates on the chemistry, i.e. the hardware aspect. 
Such efforts have therefore primarily focused on passive (trivial) informa- 
tion processing, thereby leaving out what is arguably the most distinctive 
feature of biological information - its unique mode of active (nontrivial 
or algorithmic) information processing. As we have stressed, life does not 
simply accumulate and store information: it actively processes it. Bio- 
logical information is not always passive: it includes instructions capable 
of implementing physical and chemical changes in the cell and beyond 
{e.g. by directing which structures are built and when through a series 
of nested feedback loops). It is this "active" or instructional quality of 
biological information that sets living systems apart from their nonliving 
counterparts. 

3.1 A Digital Origin for Life 

The genetics first paradigm, identifying a digital information repository as 
the most essential feature of the first living systems, is favored by biolog- 
ical approaches to the origin of life, which extrapolate backward in time 
from the properties of modern organisms. As described above, in mod- 
ern cellular hardware, DNA is used for information storage, and proteins 
execute the myriad of functions necessary to maintain living organisms. 
The perplexing duality of genotype (i.e. DNA sequence) and phenotype 
(i.e. translated protein) has long been a source of puzzlement in studies 
of the origin of life. A commonly accepted resolution is that the modern 
"DNA-protein" world evolved from simpler precursor system involving 

■^An exception is provided by the "histone code", wliere the tails of histone proteins are 
tagged with methyl groups (or other species) amounting to a digital binary on/off switch |53| . 
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only one major molecular species playing a dual role of information car- 
rier and enzymatic catalyst. In modern organisms, RNA plays the part of 
biochemical mediator, enabling the translation of DNA to protein, and is 
unique in that it can fill both roles. This dual role of RNA, acting as both 
a genetic polymer and biochemical catalyst, has led to the popular "RNA 
world" hypothesis, which suggests that RNA was the major component 
of the very first living systems prior to the advent of DNA and protein 

[MESlESlllZlllH]. 

Despite the conceptual elegance of a putative RNA world, the hypothe- 
sis faces problems, primarily due to the immense challenge of synthesizing 
RNA nucleotides under plausible prebiotic conditions and the susceptibil- 
ity of RNA oligomers to degradation via hydrolysis [591 16UI 161] . Some of 
the chemical difficulties are alleviated if RNA was preceded by an alter- 
native genetic polymer such as peptide nucleic acid (PNA) [62] or threose 
nucleic acid (TNA) |63j (for other examples of candidate primitive genetic 
polymers see e.g. |64l 1651 |66] V In genetics-first origin of life scenarios, it 
has therefore been suggested that early-life may have undergone a "hard- 
ware upgrade" transitioning from a proto-RNA genetic polymer (or even 
an inorganic substrate) into a RNA-based biochemistry at a latter stage in 
its evolutionary history. This system would then have undergone further 
hardware upgrades or "genetic takeovers" to arrive at the DNA-protein 
world we observe today [481 167] . 

Beyond the chemical difficulties associated with RNA world scenar- 
ios, lies deeper conceptual challenge within the "digital-first" picture. As 
remarked above, the proteome - and in fact nearly any biochemical reac- 
tions in the cell - process information in an analog format. In example, 
information digitally stored in DNA must be first transcribed and then 
translated before it becomes algorithmically meaningful in the context of 
the cell where it is then processed as analog information through protein 
interaction networks. Focusing strictly on digital storage therefore ne- 
glects this critical aspect of how biological information is processed. As 
we discuss below, due to the structure of systems capable of process- 
ing algorithmic (instructional) information, it is not even clear that a 
monomolecular system - where a single polymer plays the role of catalyst 
and informational carrier - is logically consistent with the organization of 
information in living systems: there is no possibility of separating infor- 
mation storage from information processing. As such, digital-first systems 
are regulated to the realm of trivial information processing only and may 
therefore not be a contender for the first living systems (although they 
may be an immediate predecessor). 

3.2 An Analog Origin for Life 

In contrast to models suggesting a digital origin for life which rely on ex- 
trapolating backward from extant biology, approaches that move forward 
from simple prebiotic chemistries typically favor an analog format for the 
first living systems, where the information is contained in the composi- 
tion of an assembly of molecules rather than in a digital string of bits. 
Metabolism first scenarios for the origin of life fall within this analog 
framework, positing that early life was based on autocatalytic metabolic 
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cycles that would have been constructed in a manner akin to how ana- 
log computer systems are cabled together to execute a specific problem- 
solving task [681 I69| . The appeal of such metabolism-first scenarios is 
that the chemical building blocks - ranging from lipids [T^, to peptides 
[151 1711 172| , to iron-sulfide complexes [161 173j - are usually much easier 
to synthesize under abiotic conditions than any known candidate genetic 
polymer and would have therefore been much more abundant on the pre- 
biotic Earth. As such, metabolic models represent a prebiotic approach to 
biogenesis based on starting with realistic mixtures of abiotically derived 
chemical building blocks. The heritable information in this case typically 
consists of the compositional ratios of the molecules in the organized as- 
semblies. Although it has been suggested that such "composomes" might 
provide a primitive inheritance mechanism [74II75| . it is not clear that they 
are evolvable, since compositional information tends to degrade over suc- 
cessive generations and may lack the capacity for evolution via standard 
Darwinian means [76] (see [77| for a recent discussion of how such systems 
could be evolvable if possessing excess mutual catalysis). Therefore infor- 
mational inheritance is not nearly as clear cut here as it is in the digital 
picture (which incidentally may be more clear-cut precisely because the 
hereditary mechanism is modeled after that observed in modern life and 
is thus well-studied). 

Additionally, in the analog-first picture there exists a deeper issue of 
(re)programmability. Analog computers fell out of favor in the mid-20th 
century due to issues of universality - no analog device, regardless of its 
structure, is capable of being sufficiently general to solve any arbitrary 
category of problem (see e.g. [ZSl)- As we discuss below, all known life 
achieves universality (at least in a limited sense) through the implementa- 
tion of a universal constructor by utilizing the digital sequence structure of 
informational polymers. Such universality would be exceedingly difficult 
to engineer in an analog-only system given the challenges associated with 
building reaction networks where each (programmed) reaction is chem- 
ically orthogonal to all other reactions (i.e. non-interacting and thus 
non-interfering), whereas orthogonality is, by comparison, relatively easy 
to achieve with digital polymer sequences [79] . An added challenge is that 
information cannot be stored indefinitely on analog devices. Analog-only 
systems are therefore not capable of adaptation in the same way as living 
systems are, where a digital memory enables switching between different 
analog processors [i.e. biochemical network) and therefore adaptability to 
changing environmental conditions with the same basic toolkit (a feature 
facilitated by the capacity for nontrivial processing). This is another way 
of stating - in informational terms - that analog systems are not as ver- 
satile or as stable as digital systems and as such likely have very limited 
evolutionary capacity. 
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4 Redefining the Problem: An Algorith- 
mic Origin for Life 

By the above considerations, it seems that digital alone or analog alone 
is insufficient to provide a satisfactory account of the origin of life - not 
just on technical grounds, but for deep conceptual reasons. The former 
suffers from difficulties of prebiotic synthesis and due to fundamental lim- 
itations on how information can be processed in such scenarios (being 
trivial rather than nontrivial); whereas the latter suffers from issues of 
reprogrammability and long-term evolvability. This dilemma forms the 
crux of the chicken-or-egg problem cited above and suggests that focusing 
on the hardware aspect alone may be limiting progress. Hoyle for instance 
has equated the chance of life emerging from a prebiotic soup containing 
all the right hardware to the chance that "a tornado sweeping through a 
junk- yard might assemble a Boeing 747 from the materials therein" |80) . 
A key insight provided by Hoyle's analogy is that Boeing 747's are built 
based on well-defined instructional blueprints that guide the assembly of 
all that hardware, just as modern life is. Just looking at the hardware 
without this additional information won't get you very far. Including 
this informational aspect is therefore necessary to understanding how an 
arrangement as complex as a Boeing 747 or the modern cell arises. 

Therefore, in our view the primary challenge inhibiting progress is that 
both the digital-first and analog-first perspectives (as currently posed) 
typically fall within a traditionally reductionist view of the origin of life, 
based on the mantra "all life is chemistry" I42U811 : both approaches focus 
on the hardware aspects rather than the software (plus hardware). This 
traditional viewpoint assumes that if we can just determine the hardware 
from which life first arose - be it digital or analog - the problem of the 
origin of life will be solved. However, as noted in the introduction, due to 
the literally billions of years of evolution intervening between us and our 
ancestral chemistry, we may never know the exact chemical hardware from 
which life first arose (if it was in fact hardware, see e.g. [51] for a "software- 
first" hypothesis). Additionally, if life arose more than once - either on 
Earth or elsewhere - it is entirely possible it may have very different 
chemistry [821 1831 1841 185) . Focusing solely on the hardware aspect can 
therefore only take us so far in elucidating any mechanism(s) which may 
be universally characteristic of the transition from non-life to life and that 
may be independent of the specific hardware. Perhaps most importantly, 
such approaches do not explain how life emerges, thus leaving it up to 
a tornado (or Darwinian evolution) to perform the magic and assemble 
the requisite hardware. This framework is equivalent to identifying all of 
the pieces in a game of chess without outlining any of the rules for how 
to play the game, thus leaving it up random chance that a rule set will 
emerge which makes sense. The trick is of course that the rules and the 
pieces must co-evolve - you may not end up with the same game every 
time, but you will at least end up with one where the rules make sense 
with respect to the pieces on the board. This becomes critically important 
in addressing the origin of life because of the unique way in which living 
systems process information where the rules are dynamically coupled to 
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the states (and as such must co-evolve). 

In contrast to this picture, an implicit assumption of traditional ap- 
proaches to the origin of life has been that, while information may be 
manifested in particular chemical structures (digital or analog), it has no 
autonomy. As such, information - although acknowledged as likely play- 
ing a central role in life's emergence - has traditionally played a passive 
role in studies of the life's origins. Instead, hardware has always been 
at the forefront of the discussion. However, as stressed above, the truly 
unique feature of biological systems is that the software and hardware are 
dynamically coupled. Due to the distributed (i.e. contextual, nonlocal) 
nature of biological information, life is characterized by a situation where 
the update rules change with time in a manner which is both a function 
of the current state and its history. Both the traditional digital-first and 
analog-first viewpoints neglect this active (algorithmic or instructional) 
quality of biological information. Yet, arguably it is this feature - that 
the algorithm evolves in time {e.g. as a response to the state of the sys- 
tem) - that gives life its distinctive "lifelike" qualities. A full picture of 
the origin of life must therefore account for both the software and hard- 
ware aspects and explain how active (nontrivial) information processing 
emerges from systems which process information only passively (trivially). 

We consider that it is this transition that is the central defining prop- 
erty of what is implicitly referred to when one speaks of the emergence of 
life. Thus the origin of life is not a chicken-or-egg problem, which engages 
a debate about this or that specific hardware, but instead is something 
fundamentally different - and perhaps more universal - corresponding to 
a transition whereby the entire way of doing business is changed. Adopt- 
ing this view, the problem of life's origin boils down to determining how 
a collective contextual information processing system emerged in which 
the information became distinct from the matter it operates on. Such a 
transition enables the emergence of nontrivial information processing with 
context dependent causal influences. It is this viewpoint which we explore 
for the remainder of this paper. 

5 The Challenge of Self- Reference 

One of the earliest treatments of the algorithmic, or instructional, nature 
of biological information was provided by von Neumann in his formulation 
of self-replicating automata [86]. Von Neumann set out approaching the 
more general problem of whether it was possible to build a machine that 
could perform any physical task - including constructing itself. Seeing 
the parallels between biological systems - such as the human nervous 
system - and computers, and drawing inspiration from Turing's work on 
universal computation, he sought a formalism that would include both 
natural and artificial systems [87] . Turing had shown that it was possible 
to build a relatively simple computer which, given enough time, could 
output any computable function |H8|. By analogy with such a "universal 
Turing machine" , von Neumann devised an abstraction called a universal- 
constructor (UC), a machine capable of taking materials from its host 
environment to build any possible physical structure (consistent with the 
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available resources and the laws of physics) including itself. An important 
feature of UCs is that they operate on universality classes, where here we 
define a universality class as the set of all possible constructions that can 
be made from a given set of building blocks. In principle, a UC is capable 
of constructing any object within a given universality class (including 
itself if it is a member of the relevant class). An example of such a 
universality class relevant to biological systems is the set of all possible 
sequences composed of the natural set of twenty amino acids found in 
proteins. The relevant UC in this case is the translation machinery of 
modern life, including the ribosome and associated tRNAs along with an 
array of protein assistants]^ This system can, in principle, construct any 
possible peptide sequence composed of the coded amino acids (with minor 
variations across the tree of life as to what constitutes a coded amino acid 

m)- 

The UC forms the foundation of von Neumann's theory on self-replicating 
automata. However, a UC is a mindless robot, and must be told very 
specifically exactly what to do in order build the correct object(s). A UC 
must therefore be programmed to construct specific things, and as such it 
must be provided with a blueprint of itself in order to replicate]^ How- 
ever, as von Neumann recognized, implicit in this seemingly innocuous 
statement is a deep conceptual difficulty concerning the well-known para- 
doxes of self-reference [901 191) . To avoid an infinite regress, in which the 
blueprint of a self-replicating UC contains the blueprint which contains 
the blueprint ... ad infinitum, von Neumann proposed that the blueprint 
must play a dual role: it should contain instructions - an algorithm - to 
make a certain kind of machine (e.g. the UC) but should also be blindly 
copied as a mere physical structure, mthout reference to the instructions 
its contains, and thus reference itself only indirectly. This dual role is 
familiar from that played by DNA in modern life, where genes act both 
passively, as hereditary structures to be copied, and actively, as a source of 
algorithmic instructions. To implement this dualistic role, von Neumann 
appended a "supervisory unit" to his automata whose task is to supervise 
which of these two roles the blueprint must play at a given time, thereby 
ensuring that the blueprint is treated both as an algorithm to be read-out 
and as a structure to be copied depending on the context. In this manner, 
the logical structure of a von Neumann automaton ensures that instruc- 
tions and their physical representation are distinct - thus satisfying the 
hallmark feature of nontrivial information processing as outlined above, 
while simultaneously providing a mechanism for instantiating indirect, 
self-referential dynamics. To be functional over successive generations, a 
complete self-replicating automaton must therefore consist of three com- 
ponents: a UC, an (instructional) blueprint, and a supervisory unit. 

All known life contains these three components, which is particularly 
remarkable, given that von Neumann formulated his ideas before the dis- 
coveries of modern molecular biology including the structure of DNA and 

■^In is important to note that the mapping between extant life and a von Neumann automata 
is not precise - it is not at all clear that DNA contains the algorithm for building an entire 
cell - other factors such as epigenetic and environmental ones, influence the final outcome. 

^Likewise a UC can construct any other object within its universality class if fed the 
appropriate instruction to do so. 
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the ribosome. From the insights provided by molecular biology over the 
past 50 years, we can now identify that, at least to a rough approximation, 
all known life functions as a von Neumann automatcj^ where DNA pro- 
vides the blueprint, ribosomes act as the core of the universal constructor 
and DNA polymerases along with a suite of other molecular machinery 
play the role of supervisory unil|^ [911 192| . However, the more important 
insight provided by von Neumann's work may not lie in the association of 
specific automata components with specific biochemical structures (which 
is not an exact mapping), but instead in how they inform our understand- 
ing of the logical structure of information processing and information flow 
in living systems. In particular, the foregoing theory for so-called nontriv- 
ial self-replicating automata, which appears to fit the logical organization 
of known biology very well, must be contrasted with trivial replication of 
the sort displayed by crystals. Being the regime where these two domains 
intersect, this distinction becomes increasingly important in discussions 
of life's origin, a discussion to which we now turn. 



5.1 Trivial versus Nontrivial Self-Replication 

Nontrivial replicators of the sort devised by von Neumann - including 
known life - are distinct from trivial replicators such as viruses, nonenzy- 
matic template replicators, lipid composomes, crystals, computer viruses, 
and Penrose blocks |93] . Cast in the language of the discussion in the pre- 
vious section, trivial replicators are systems capable of self-replication, but 
which process information strictly in the passive sense. Typically, triv- 
ial replicators are characterized by building blocks which are not much 
simpler than the assembled object. Schrodinger recognized this key dis- 
tinction in his take on What is Life? when he postulated that the genetic 
material of life must be an "aperiodic crystal" [24] . Algorithmic informa- 
tion theory can make this statement precise. The algorithmic information 
of a system or structure is defined to be the Shannon information con- 
tained in the shortest algorithm that can specify the system or structure 
as the output of the algorithm [941 1951 |96] . A periodic crystal, for ex- 
ample, has very low algorithmic information because it may be specified 
by a compact instruction set of the form, "Add another atom at distance 
X and repeat N times." Structures that contain patterns contain redun- 
dancy, and as such can be specified by algorithms with low- information 
content. On the other hand, random structures are algorithmically in- 
compressible: they cannot be specified as the output of algorithms much 
simpler than themselves. Gene sequences are random in this respect, i.e. 
genes typically contain no patterns or redundancy]^ Thus a trivial replica- 
tor like a crystal is one that may be specified by an algorithm containing 

ribosome is fed an instruction to make a protein, but does not directly produce itself. 
This feature however is crucial to the evolutionary capacity of living systems - if the UC 
directly constructed itself, any reprogramming might destroy the functionality of the UC and 
therefore restrict the automata from evolving by natural selection. 

^An important note is that the all-important dual role cited above is clearly implemented: 
DNA polymerases are oblivious to the instructions that DNA contains and will blindly copy 
both coding and noncoding sequences. 

^Occasionally genes get duplicated, but within a gene the sequence is random. 
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far fewer bits than the system it describes, whereas a non-trivial rephcator 
is algorithmically incompressible and requires an algorithm, or instruction 
set, of complexity comparable to the system it describes (or creates). 

An even greater logical divide between trivial and nontrivial replicators 
exists because the former is not explicitly programmed, but rather relies 
on the implicit physics (and chemistry) of the current environment to sup- 
port replication and therefore only a limited set of objects within a given 
universality class are constructible. In other words, trivial self-replicating 
systems can only access one instructional mode - the one which the sys- 
tem is currently operating in - and as such are capable of only passive 
information handling. This stands in stark contrast to the case for non- 
trivial replicators, where any possible object within a given universality 
class (as defined above) - including the UC - can be constructed if the UC 
is provided with an appropriate instruction. Nontrivial replicators may 
therefore be explicitly programed to construct specific objects, and do so 
in some sense independent of direction by the implicit physics and chem- 
istry (although of course adhering to the constraints imposed by physical 
law). As such, only nontrivial replicators process information in an active 
sense, enabling the possibility for the state to change in response to the 
dynamics and vice versa. Because of this fundamental distinction in how 
information is handled and processed, nontrivial and trivial replication are 
two logically and organizationally distinct possibilities for self-replicating 
physical systems, which are distinct in much the same manner as systems 
which actively process information are distinct from those that do so only 
passively. 

5.2 Algorithmic Takeover 

Although modern life is clearly representative of the class of nontrivial 
self-replicators, the majority of work on the origin of life has focused on 
the conceptually simpler case of trivial self-replication. This not with- 
out good reason: the origin of translation - mediating what is know of 
the transition from trivial to nontrivial - is notoriously difficult to pin 
down, amounting to an algorithmic takeover of information stored in one 
molecular species that becomes operable over another. To enact such a 
feat, life as we know it requires two complementary sets of very difi'erent 
molecules, nucleic acids and peptides - it is therefore "bimolecular" . The 
division of labor implicit in bimolecular life bestows one very obvious and 
distinctive advantage; it enables the instructions to be physically sepa- 
rated and stored from the hardware that instantiates them. The encoded 
transactions between the two sets of molecules takes place via messengers 
and bilingual agent^ that identify, and are read by a system that can 
decode the instructions. Thus the software, or algorithm, inhabits one 
molecular universe and its products inhabit another. This arrangement 
is familiar from human technology. The software controlling a robot arm, 
for example, need not be located in the arm, and the substrate contain- 
ing the software is not made out of the physical material that constitutes 

* "Bilingual" here means tRNA molecules that recognize both the four-letter alphabet of 
nucleic acids and the twenty-letter alphabet of amino acids. 
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the arm. Or to take another example, the instructions to operate a ra- 
dio controlled plane are located far from the plane and relayed via coded 
electromagnetic pulses. The software is located on a chip in the trans- 
mitter and not made out of plane components. Contrast this with a kite, 
which is much more cumbersome because its controlling elements (wires) 
are coupled directly to the kite and made of the same sort of stuff. In the 
case of the radio control plane, the operator and plane are decoupled and 
the operator may, for example, run in one direction and send the plane in 
another. This is not true for the kite, which is always physically connected 
to its operator and therefore subject to direct manipulation by his or her 
relative physical motion. 

Often, algorithmic takeover {e.g. epitomized in the origin of trans- 
lation) - mediated via the separation of information from its physical 
representation - is implicitly considered to be much later than the origin 
of life itself, even while it is simultaneously agreed that without a proper 
definition for life, identifying precisely when in the evolutionary process 
life emerges from non-life is not feasible. The justification for such an 
approach is rooted in the Darwinian criterion (replication plus heritable 
variation), which assumes that a simple Darwinian system is all that is 
necessary to get the process started - from there, as discussed in the in- 
troduction, the line or reasoning then follows that natural selection will 
provide a sufficient driver of complexification toward the living state. As 
such, the problem of the origin of life has effectively become synonymous 
with solving the conceptually simpler problem of the origin of Darwinian 
evolution [2511261 [27ll97| . thereby skirting the issue of identifying precisely 
what we mean by the "emergence" of life in the first place. More specifi- 
cally, this approach neglects the distinction between trivial and nontrivial 
replicators, treating them on equal footing given that both can, in princi- 
ple, evolve through Darwinian means. 

For example, classic RNA world models, including the "holy grail" of 
RNA world chemistry - the RNA self-replicator [57l[98 , 99 , 100l[ToTl[102] - 
and more diverse models including RNA quasispecies 46, 103] and hyper- 
cycles |47j . nearly universally represent examples of trivial self-replication, 
since these systems can, at least in principle, make more of themselves, or 
nearby mutants in sequence space, but cannot be programmed to make 
any possible construction in their particular universality class {e.g. any 
possible RNA sequence). Instead, such systems must rely on the environ- 
ment to be suitably prepared to support replication with just the right 
mixture of chemical building blocks. If the environment changes, the sys- 
tem will "evolve" such that a new fitness peak is achieved and a new 
distribution of the most efficiently replicating RNAs found. However, this 
type of evolution does not solve the problem of universal construction or, 
even more critically, provide a means for instantiating self-referential dy- 
namics - characterized by a convolution of dynamical laws and states - 
that are a hallmark of living systems (and perhaps the hallmark as out- 
lined here). Therefore, although a hypothetical RNA world may store and 
propagate information in an analogous fashion to modern life and satisfy 
the minimal requirements for Darwinian evolution (hence the appeal), it 
does not allow the possibility for (universal) information processing in the 
same way that all known life-forms do. The RNA world - or any other 
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strictly "digital-first" paradigm - is thus separated from known life by a 
logical and organizational chasm that cannot be crossed by mere complex- 
ification. In the case of the RNA world as currently understood, we see 
a clear example of systems considered to have Darwinian potential, but 
falling short of being truly living - such scenarios do not possess any of 
the information processing capabilities that are truly a hallmark of life: 
all RNA world scenarios so far proposed lack non-trivial self-referential in- 
structional dynamics. Instead these (and most other origin of life scenar- 
ios - digital or analog) fall in the class of systems that can only passively 
process information in a trivial sense. 

What, then, would it take for a RNA world, for example, to transition 
from trivial to nontrivial information processing? If we assume explic- 
itly encoded instructional information is essential - as it is in modern 
life - the solution requires a UC that acts on the universality class of all 
RNA sequences. The ribosome is not it. Although ribosomes are partly 
composed of RNA, the universality class they act on is not RNA, but 
proteins. Discovering an RNA world equivalent of a UC might provide 
a pathway to true RNA life in the von Neumann sense. We might call 
this "monomolecular life". Monomolecular life, complete with its own 
UC, might be a necessary stepping stone on the way to the bimolecular 
arrangement observed today, or it may be that a RNA-only UC was not 
required for a transition to the RNA-protein world. Discovering which 
scenario played out in the history of life on Earth would shed tremendous 
light on how the first instructional biological language may have emerged, 
thereby mediating the transition of algorithmic takeover. However, if 
primitive "life" was in fact a monomolecular, it would operate more like a 
kite than a radio controlled plane - there would be no way to physically 
decouple the instructions from the hardware. Thus, while a monomolec- 
ular system might satisfy a primitive definition of life, it would be very 
inefficient and cumbersome to evolve. Of course, we are familiar with 
the fact that digital information processing and control are superior to 
analog systems when it comes to modern technology. Somehow life "went 
digital" billions of years ago, presumably at the point where hardware 
and software became separated. Siuc(^ the logical structure of a UC is 
independent of the materials from which it is constructed, this raises the 
interesting question of whether alternative instantiations of UCs might 
be possible in extraterrestrial life, and whether or not all life requires a 
logically similar translational apparatus to enable nontrivial information 
processing. However, it is hard to see how this cjuestion can be answered 
without a better understanding of how universal construction arose on 
Earth. 

6 The Origin of Life: A Transition in 
Causal Structure 

The most important insight from the forgoing discussion is that algorith- 
mic takeover mediates a well-defined transition (even if not well-understood) 
whereby the informational narrative is freed up to be almost independent 



16 



of the material one. Thus one may sensibly discuss cell-sell signaling say, 
or sense data flowing along nerves to the brain, without specific reference 
to the underlying activity of electrons, protons, atoms and molecules. One 
might say that the emergence of contextual information was the step that 
effectively separated information from matter in biology. Of course all 
information requires a material substrate of some sort, but life cannot be 
understood in substrate terms alone, as the vitalists once thought. Thus 
it would be absurd to say that an atom in a strand of DNA is alive, yet the 
organism as a whole is alive; in short, "aliveness" is an emergent global 
property |8T1IT04| . 

Since von Neumann's formalism may be equally well applied to both 
natural and artificial automata, by the above considerations it would seem 
that a self-replicating computer program should be considered living if it 
possesses a UC. Ifowever there is a key difference in how algorithmic infor- 
mation is processed in the natural and artificial arena, a distinction that 
may provide clues into narrowing the definition of what the living state 
actually is. On a computer, self-replicating automata (i.e. self-replicating 
software programs) execute replication by creating a new copy of their 
software program and storing the newly generated data in an already es- 
tablished and immutable hardware. In biology, information is copied and 
algorithmic instructions executed as mutable hardware is manipulated, 
created, and destroyed through biochemical interactions. In essence, bio- 
logical systems are unique because the information manipulates the matter 
it is instantiated in. In biology, there is no firm distinction between hard- 
ware and software or between program and data - the program is the data 
and the data is the program [42]. This leads to a very different causal 
narrative then that observed in nonliving systems: life is characterized by 
a situation where almost all causal factors are context-dependent. The 
efficacy of information therefore permits multidirectional causality with 
causal influences running both up and down the hierarchy of structure 
of biological systems {e.g. both from state to dynamics and dynamics to 
state). 

Therefore, just as software engineering is a discipline that can proceed 
with design features and improvements with little reference back to the 
silicon world that supports it, so life may control its environment, interact 
with other life and evolve, via the modification and flow of information. 
A striking illustration of the foregoing property is the creation of my- 
coplasma laboratorium, in which the software of mycoplasma genitalium 
was re-programmed by designing new software, inserting it into a cell, 
and booting up the new program |105) . Because the operating system is 
the same for all known life, modifying the information content of the cell 
was relatively easy. Expressed differently, the two organisms - donor and 
recipient - belong to the same universality class. The versatility that this 
straightforward experimental procedure exemplifies is in marked contrast 
to the prospect of designing a new cell from scratch, an almost impossible 
task far beyond current technology. What matters here is not the physical 
stuff that gets from one organism to another, but the functional informa- 
tion content; it is the information transfer that counts, not the specific 
material substrate that encodes it. It then becomes possible to describe 
biological activity in purely informational terms, including constructing 
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causal narratives about information fiow, without needing to refer back to 
the material substrate itself, or its own causal narrative. The information 
literally takes on "a life of its own" . 

The liberation of the informational narrative from the material one 
therefore elicits a well-defined transition which may be the key step in the 
origin of life. To achieve this, the von Neumann formalism requires algo- 
rithmic information to be explicitly encoded. However, it is an open ques- 
tion whether such explicitly encoded algorithmic information is a strict 
requirement for this kind of informational and causal narrative. A more 
relaxed condition for the emergence of life may be decoupling of informa- 
tion and matter due to delocalization of information corresponding to a 
delocalization of causal influences - features which may be more generi- 
cally characteristic of self-referential dynamics. Such a transition requires 
the abstraction of information from its physical representation, and may 
therefore be a necessary precondition to algorithmic takeover which re- 
quires a strict separation of software (information) from hardware (mat- 
ter) mediated by an explicit encoding scheme. The most critical question 
in addressing the origin of life may therefore be: when does a chemical 
system operating more like a kite, start to behave more like the radio 
controlled airplane? Such a transition should be marked by a shift in 
causal structure, where information (a "high-level" systemic entity) gains 
causal efficacy over the matter it is instantiated in. The origin of life may 
therefore be characterized by a well-defined transition mediated by a re- 
versal in the dominant direction of information flow from lower to higher 
levels of organization (bottom-up), to that from higher to lower levels of 
organization (top-down) which corresponds to a fundamental transition 
in causal structure. 

6.1 Top-Down Causation and Informational Effi- 
cacy 

Given the hierarchy of structure and causation of living systems, a full ex- 
planatory framework for biological processes should include both bottom- 
up causation (Figure [l^) - such as when a gene is read-out to make a 
protein that affects cellular behavior - and top-down causation (Figure 
[Tja) - as occurs when changes in the environment initiate an organismal 
response that permeates all the way down to the level of individual genes 
|41| . Bottom-up causation has been the status-quo throughout the history 
of science, whereas top-down causation is less familiar and more difficult 
to quantify. Top-down (or downward) causation may be unique to the bio- 
logical realm in the sense we are using it, and has been suggested to occur 
at all scales of biological organization as a mechanism for explaining in- 
formational and causal hierarchies in living systems [TJ [2l [S] |4l 1391 1401 141| . 
Generally, top-down causation is characterized by a 'higher' level influ- 
encing a 'lower' level by setting a context (for example, by changing some 
physical constraints) by which the lower level actions take place [3l l39ll40j . 
In other words, top-down causation is characteristic of systems where al- 
most all causal factors are context-dependent and therefore depend on 
information stored in the current state of the system {e.g. in a set of 
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Figure 1: Bottom- up and top-down modes of causation, (a) The standard 
(reductionist) view suggests everything in the universe is directed by bottom- 
up action only, such that causation flows strictly from lower to higher levels, 
(b) Biological organization suggests an alternative causal structure whereby 
bottom-up modes of causation emanating from lower-levels provide a space of 
possibilities, and higher-levels of organization modify the causal relations below 
via top-down causation. Figure adapted from Auletta, Ellis and Jaeger [39] . 



context-dependent constraints which dictate the dynamics). 

A classic example of top-down causation is provided by natural selec- 
tion in evolution [Tl [2l [T06] . where higher level processes {e.g. environ- 
mental factors and organismal phenotype) constrain and influence what 
happens at lower levels {e.g. biochemistry). This is particularly evident 
for cases of convergent evolution |107j , of which there are many examples 
including wings, eyes, and the nonhomologous families of sugar kinases 
|108| . A very recent example of convergent evolution is the discovery 
of the independent origins of striated muscles in cnidarians and bilateri- 
ans [109j . In each case of evolutionary convergence, the commonality of 
form derived from independent origins is attributable to physical (envi- 
ronmental) constraints, which manifests a particular phenotypic trait in 
the organism (i.e. a wing, an eye, a muscle-type or a particular enzymatic 
functionality). However, the effect is also a local physical one. For exam- 
ple, the biochemical interactions - dictated in part by both genetic and 
epigenetic programming - that govern the morphological development of 
something as complex as a wing, an eye, or a muscle, are inherently local. 
Likewise the function of kinase enzymes is dependent on the larger context 
of the network of protein interactions within which they are embedded, 
but their proper folding is dependent on purely local effects. As such, nat- 
ural selection provides a well-known example of how higher level processes 
{e.g. environmental selection) constrain and influence what happens at 
lower levels {e.g. biochemistry)]^ 

^Although it is normal for biologists to discuss causal narratives in informational terms {e.g. 
cells "signal" each other, and "recruit molecules" to "express instructions" . . . ) a determined 
reductionist would argue that, in principle, this narrative would parallel an, albeit vastly more 
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The top-down causal efficacy of information relies on the notion of 
equivalence classes, where a given higher-level state leads to the same 
high-level outcome, independent of which lower-level state instantiates it 
|40| . In biology a function defines an equivalence class: operations are 
considered (functionally) equivalent (i.e. in the same equivalence class) if 
they produce the same outcome for some associated functional purpose. 
This is evident in the examples of convergent evolution cited above, in each 
case convergence occurs because the selected structures fit a functionally- 
equivalent role. Auletta et al. present a more rigorously studied example 
of this particular concept with RNAse P enzymes |39| . Type A RNAse 
P is found in E. coli whereas type B is found in B. subtilis. The two 
enzymes have very different three-dimensional structures, with different 
properties in vitro; however m vivo these two very different enzymes per- 
form the same functional role and can in fact replace one another without 
loss of functionality |110| - that is to say that the two molecules, al- 
though structurally different, are functionally equivalent. The notion of 
functional equivalence is closely related to that of biological modularity, 
where a module is a separable functional unit within a biological network 
[11111112] . Modules may be replaced by other members of their functional 
equivalence class without affecting the overall network structure {e.g. one 
may replace a node within a network and preserve the overall network 
structure so long as all the edges - i.e. the functional interactions - are 
left intact). The functional (semantic information) is therefore decou- 
pled from the local physics and chemistry - any local structure will do 
so long as the overall network structure - characterized by information 
stored in the state of the system - is maintained. Functional equivalence 
among lower level states therefore enables higher-level processes to have 
autonomy from, and therefore top-down causal efficacy over, lower-levels, 
permitting the unique feature that the informational narrative is freed up 
from the material one. 



6.2 Biological Hierarchies and Causality 

The foregoing discussion indicates that top-down causation - mediated 
by informational efficacy - plays an important role in dictating the dy- 
namics of living systems, where causal influences run both up and down 
modular ly organized structural hierarchies. At the very least it provides 
a productive phenomenological framework for addressing the outstanding 
problem of how new-levels of biological organization emerge. Walker, Cis- 
neros and Davies (WCD) have recently proposed that many of the major 
evolutionary transitions - ranging from the origin of life, to the origins 
of multicellularity and eusociality - might be characterized by transitions 
in causal structure, mediated by information gaining causal efficacy over 
increasingly higher levels of organization [113j . In many of the major evo- 
lutionary transitions, new higher-level entities emerge from the collective 
and coordinated behavior of lower-level entities, eventually transitioning 

complicated, account in terms of molecular interactions alone, in which only material objects 
enjoy true causal efficacy. In this paper we remain agnostic on the question of such "promissory 
reduction" because our principal claims remain valid even if the informational-causal narrative 
is accepted as a mere fafon de parler. 
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to a state of organization where the lower-level entities no longer have 
reproductive autonomy |45l 1501 11141 I115j . During such transitions, the 
dynamics of lower level entities come under the direction of an emergent 
high-level entity. This, coupled with the multidirectional causal influences 
in biological organization, suggests that evolutionary transitions that in- 
corporate new, higher levels of organization into a biological system should 
be characterized by a transition from bottom-up to top-down causation, 
mediated by a reversal in the dominant direction of information flow. 
Therefore, major shifts in biological complexity - from lower level entities 
to the emergence of new, higher level entities - may be associated with a 
physical transition (possibly akin to a thermodynamic phase transition), 
and this physical transition is in turn associated with a shift in causal 
structure. 

In the present paper we hypothesize that the forgoing reversal of infor- 
mation flow is associated with the most critical transition of all, namely, 
that from non-life to life. To demonstrate such a reversal as a generic fea- 
ture of many of the major evolutionary transitions, WCD presented a toy 
model investigating the emergence of non-trivial collective behavior in a 
globally coupled chaotic map lattice composed of logistic maps |116II117] . 
The logistic growth model was chosen for its connection to the replicative 
growth of biological populations |118| . thereby enabling an analogy with 
the transition from independent replicators to collective reproducers, cited 
as a hallmark feature of several of the major evolutionary transitions [45j . 
The model provides a clear example of how a reversal in information flow 
- from bottom-up to top-down - can describe a transition from a group 
of independent low-level entities to the emergence of a new higher-level 
(collective) entity. 

The model system of WCD is defined as 

Xi^n+i = (1 - e)fi{xi^n) + e m„ ; (i = 1, 2, . . . TV) (1) 

where the function fi{xi^n) specifies the local dynamics of element i, N is 
the total number of elements, n is the current time-step (generation), and e 
is the global coupling strength to the instantaneous dynamics of the mean- 
field, m„, defined below in eq. |4] In analogy with biological populations, 
the element index i may be associated with a specific phenotype within 
a given population, and e marks the strength of the global informational 
control over the local dynamics of each such element. The local dynamics 
of each element i is defined by the discrete logistic growth law 

fiixi,n) = r\Xt,„ (^1 - (2) 

where Vi is the reproductive fitness of population i, and K is the carrying 
capacity (set to K = 100 for all i in WCD). The instantaneous state of 
the entire system at time step n is specified as an average over all local 
states by the instantaneous mean-field M„, 

1 ^ 



21 



40 50 60 70 80 

M 

n 



Figure 2: Return map for varying values of the global coupling strength e. 
Shown are return maps for e = (magenta), e = 0.075 (red), e = 0.1 (blue), 
e = 0.2 (orange), e = 0.225 (aqua), e = 0.25 (dark green), e = 0.3 (stars), 
and e = 0.4 (black). Also shown is the return map for a single logistic map 
(bright green). The inset shows an expanded view. Return maps for e = 0.2 
(orange), e = 0.225 (aqua), e = 0.25 (dark green) demonstrate the spontaneous 
emergence of collective behavior coincident with a dominance of top-down causal 
information transfer (shown in Figure [s]). Figure adopted from Walker, Cisneros 
and Davies |113j . 



The instantaneous dynamics of the mean-field, defined as 



is a global systemic entity (i.e. it cannot be identified with any specific 
local attribute), which has direct impact on the dynamics of local elements 
i in the model system. The infiuence of this abstract global entity is 
dictated by the global coupling strength e. 

Following the dynamics of a set oi N = 1000 coupled logistic maps, 
WCD generated a time series of both the instantaneous states of the local 
elements, Xi^n, and of the mean-field, M„ from which causal directionality 
and the associated flow of information were determined. In general, the 
trends observed indicate that each time a collective state emerges, causal 




(4) 
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Figure 3: Top-down, Tm^x (solid), and bottom-up Tx^m (dashed), causal 
information transfer for varying global coupling strength e for a coupled chaotic 
logistic map lattice. Figure adopted from Walker, Cisneros and Davies |113| . 



information transfer is dominated by information flow from global to lo- 
cal scales (see Figures [2] and |3| . Particularly interesting is that top-down 
causation dominates for collective states in regimes with 0.2 < e < 0.7, 
where the contribution from the global dynamics is not necessarily the 
dominant contribution in eq. ([T| [i.e. for 0.2 < e < 0.5). In this regime, 
although the weight of the contribution from the global scale may be less 
than the contribution from the local scale in dictating the local dynamics, 
collective states self-organize which are driven by top-down causal infor- 
mation transfer from the mean-field. The results presented for this toy 
model system indicate that a transition from a population of independent 
replicators, to a collective representing a higher-level of organization, can 
be mediated by a physical transition from bottom-up to top-down infor- 
mation flow, where non-trivial collective behavior is associated with the 
degree to which local elements receive information from the global net- 
work. 

The results presented for the toy model system indicate that a tran- 
sition from a population of independent replicators, to a collective rep- 
resenting a higher-level of organization, could be mediated by a physical 
transition from bottom-up to top-down information flow. The dynamical 
system investigated was designed to parallel transitory dynamics believed 
to be a hallmark feature of many major evolutionary transitions - i.e. 
those characterized by the emergence of higher-level reproducers from 
previously autonomous lower- level units [45]. Here, new high-level enti- 
ties would be expected to emerge as e — >■ 1 (although non-trivial collec- 
tive behavior is observed to emerge in intermediate regimes, as discussed 
above). Examples of major evolutionary transitions where similar dynam- 
ics are expected to have played out include the origin of life, the origin 
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of eukaryotes, the origin of multicellularity, and the origin of eusociality. 
For a more detailed discussion, see Walker, Cisneros, and Davies |113) . 
Importantly, the origin of life may be associated with the most important 
and fundamental of these transitions, being the first level of organization 
where such a reversal of information flow might be observed. 

6.3 Quantifying the transition from Non-Life to 
Life 

Given that when life as we know it first emerged, it was surely character- 
ized by the same distinctive hierarchical and causal structure as all known 
life, the critical step in the origin of life may be associated with a physical 
transition (perhaps akin to a thermodynamic phase transition as noted 
above), and this physical transition is in turn associated with a funda- 
mental change in causal structure (schematically illustrated in Fig. [2|. 
The origin of life thus is unified with, but distinct from, the "higher-level" 
evolutionary transitions noted above in that it marks the first appearance 
of this reversal in causal structure, thus making it a unique transition in 
the physical realm (marking the transition from trivial to nontrivial in- 
formation processing). The utility of this approach is that it provides a 
useful criterion for measuring the "emergence" of life, with a clear defini- 
tion of one should look for - the origin of life should be characterized by a 
transition from bottom-up to top-down causation, mediated by a reversal 
in the dominant direction of the flow of information from lower to higher 
levels of organization (bottom-up), to that from higher to lower levels of 
organization (top-down), as observed for the toy model described above. 

Therefore, to quantify the transition from nonlife to life, a measure of 
causal information transfer is necessary. Standard measures of informa- 
tion, such as Shannon or mutual information, rely on static probabilities. 
In other words, they can only be used to characterize passive (trivial) in- 
formation processing, but fail to capture the causal efficacy of context that 
distinguishes living systems. A measure of causal information transfer is 
therefore necessary to attempt to capture this aspect of context-dependent 
causation. In order to infer causal information transfer (i.e. from higher 
to lower levels versus from lower to higher levels of organization, or as in 
the example of the previous section, from the mean-field to local elements 
versus from local elements to the mean- field) , a measure that can capture 
dynamical structure by means of transition probabilities rather than static 
probabilities is required. 

In Figure |3j a measure of causal information transfer was implemented 
which is able to effectively distinguish driving and responding elements 
and to detect asymmetry in interactions between systems |119| . The dy- 
namical character of the interactions is extracted by introducing a time lag 
in order to compute the relevant transition probabilities. As an example, 
one may consider a Markov process of order k. The traditional conditional 
probability p{xn+i\x„, . . . ,Xn-k+\) = p{x n+l I ; ■ ■ ■ 7 ^n — k-\-l t '•^n — k ) de- 
scribes a transition probability whereby each state a;„+i of the process 
is dependent (conditional) on the last fc-states but is independent of the 
state Xn-k and all previous states. This conditional relationship can be 
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extended to any fc-dimensional dynamical system as prescribed by Takens 
embedding theorem |120j . To simplify notation, an embedded state may 
be defined as = (a;„, . . . ,x„^k+i), which describes a state in the k- 
dimensional phase space, such that the series of vectors {xl^^^ } contains all 
of the information necessary to characterize the trajectory of the dynam- 
ical variable x. Using this definition, the dynamical information shared 
between two processes, x and y, may then be quantified by the transfer 
entropy |119| : 

Ty%x = 5Z ' ' y"*' ) log 

n 

This measure incorporates causal relationships by relating delayed (em- 
bedded) states, xi^^^ and to the state x„+i, and quantifies the incor- 
rectness of assuming independence between the two processes x and y. In 
short, the transfer entropy provides information about the deviation from 
the expected entropy of two completely independent processes. This par- 
ticular measure was implemented in toy model of Walker, Cisneros and 
Davies (see Figure [sj, using the time evolution of the global "mean-field" 
and one local element as the two dynamical time series to be correlated. 

We cite this measure of causal information transfer here as a specific 
example of the kind of information measure that is needed to infer causal 
relationships. However, any measure which captures the dynamical rela- 
tionships dictating the directionality of information flow would likewise 
provide a useful measure of transitions in causal structure. Additional 
measures which infer causal directionality |121| or measure the emergence 
of causal states [1221 112j] . i.e. delocalization information and as such of 
of causal influences, would likewise contribute to a deeper understanding 
of the transition from nonliving to living matter. An important feature of 
this formalism is that one need not characterize being alive or dead in a 
binary fashion, but might also observe classes of "almost life" . A relatively 
unappreciated challenge in research on the origin of life is that even if we 
adopt a particular working definition of life, we still need a definition of 
"almost life" to characterize systems which may embark on the pathway 
toward life but never reach the requisite complexity (if it indeed turns out 
that we can define a minimal complexity for life, perhaps by measuring a 
minimal complexity to support informational efficacy as described here). 
It makes sense to try to explain life's origin only if it resulted from pro- 
cesses of moderately highly probability, so that we can reasonably expect 
to explain it in terms of known science. Thus the origin of life field is 
based on the assumption that although we have only a vague idea what 
actually took place, whatever happened was not exceedingly improbable. 
In this paper we have subscribed to this currently popular but far-from- 
proven assumption (see Carter [124] for a discussion of the consistency 
of a single observed biosphere with post-selection of a set of exceedingly 
rare processes). Proceeding then on the assumption that the origin of life 
is not exceedingly improbable, it follows from simple statistics that there 
will have been a large ensemble of systems proceeding down the pathway 
toward life, and no obvious reason why only one member successfully com- 
pleted the journey. Ideally then, there should be a parameter, or more 



p{Xn+l\x'n\yn^) 



p{Xn + l\x^ 
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Property 


Necessary to Known Life 


Necessary to All Life 


Digital Information 
Storage 


Yes, required for heredity in extant 
life 


Maybe, likely necessary for 
longevity and instructional 
orthogonality but may not be 

essential to defining life 


Analog Information 
Processing 


Yes, biochemistry processes infor- 
mation through analog biochemi- 
cal reaction networks 


Maybe, but analog alone will ex- 
hibit limited evolutionary capacity 
(analog life may not have staying 
power) 


Explicitly Encoded 
Information 


Yes, genetic encoding is mediated 
by 'bilingual' molecules 


Maybe, requisite of abstraction 
and digitization of information, 
but it is at present unclear if this 
is requisite of the dclocalization of 
cause 


Dclocalization of 
Causal Influences / 
Context Dependent 
Causation 


Yes, characterized by separation of 
information from its physical rep- 
resentation 


Yes, may be a defining characteris- 
tic of all life 


Dynamical Laws 
Change with States 
(Nontrivial Informa- 
tion Processing) 


Yes, characterized by convolution 
of dynamical laws and states (state 
determines the dynamics and the 
dynamics determine the state) 


Yes, may be a defining characteris- 
tic of all life 



Table 1: The hallmarks of life. 



likely a set of parameters, to quantify progress toward life. Such a con- 
structive measure of how close chemical systems are to the living state has 
been notoriously absent in almost all discussions of the origin of life. With 
the formalism presented here, this gradation of states of "almost life" may 
be quantified by adopting a variant of the transfer entropy and applying 
it to the relevant chemical kinetics. The most challenging aspect of using 
this approach is in identifying which dynamical times series are relevant 
and useful for determining the causal structure of a particular system. Ap- 
plications of this approach to the relevant chemical kinetics might include 
analyzing time series or reaction rates or the concentrations of monomers, 
polymers, or even functional moieties. This may provide new avenues of 
research into the origin of life by directing efforts toward understanding 
how chemical systems come under direction of the global context rather 
than focusing strictly on the evolutionary processes that might enable a 
transition to the living state but do not necessitate it. 

7 Conclusions 

We have presented a framework for understanding the origin of life as a 
transition in causal structure whereby information (stored in the state of 
a system) gains causal efficacy over the matter it is instantiated in. The 
hallmarks of living systems based on this approach as discussed in this 
paper are summarized in Table [l] The advantage of this perspective is 
that it provides a foundation for identifying the origin of life as a well- 
defined transition. How this transition occurs remains an open question. 
While we have stressed that Darwinian evolution lacks a capacity to elu- 
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cidate the physical mechanisms underlying the transition from non-life 
to life or to distinguish nonliving from living, evolution must still drive 
this transition (even if it does not define it). It is likely that nontriv- 
ial information processing systems with delocalized information are more 
evolutionarily robust given that information can be preserved in the face 
of changing environmental conditions due the separation of instructions 
and their physical representation. Analog life-forms could have existed in 
the past (or could exist now as part of a shadow biosphere |85]) but are 
not likely to survive over geological timescales without explicitly digitized 
encoding of information. Therefore life-forms that "go digital" may be the 
only systems that survive in the long-run and are thus the only remain- 
ing artifact of the processes that led to life which survived to the present 
epoch. As such, the onset of Darwinian evolution in a chemical system 
was likely not the critical step in the emergence of life as is typified by 
analog and digital perspectives which focus solely on the storage and prop- 
agation of information. As we have seen, trivially self-replicating systems 
can accomplish this. Instead, the emergence of life was likely marked by 
a transition in information processing capabilities. This transition should 
be marked by a reversal in the causal flow of information from bottom-up 
to top-down. This criteria is more robust than the Darwinian one, in 
that it provides a measure for determining precisely when the origin of 
life occurs in a given system, rather then merely identifying systems with 
the capacity to potentially evolve to a living state. It therefore provides 
new avenues for understanding how such a transition might occur. 

A distinguishing feature of the shift in causal structure associated with 
the origin of life - which sets it apart from more traditional phase transi- 
tions - is that one must take into account causal relations to infer which 
state [e.g. living or non-living) a given system is in. Therefore, the tran- 
sitory dynamics and state (alive or not) of a particular system can only be 
determined by analyzing a time series of data characterizing the systems 
dynamical evolution. You cannot look at the system at any one instant 
in time and identify as alive or not - life is a dynamical and emergent 
systemic property, thus the oft-repeated point that "no single atom in 
a cell is alive, but the cell as a whole is living" . Thus, for example, a 
desiccated organism might have all the requisite hardware of a living or- 
ganism, but one cannot generically identify it as such (i.e. with any of 
the typical identiflers such as metabolic activity or reproduction) unless 
it is actively processing information under hydration (in this case a nec- 
essary facet of the context). Characterizing the origin of life as a shift 
in causal structure marks it as a unique transition in the physical realm 
- distinguishing nonliving dynamical systems, which display trivial infor- 
mation processing only, from living systems (and the complex systems 
derivative of biological systems) which display nontrivial information pro- 
cessing as two logically and organizationally distinct kinds of dynamical 
systems (note this does not preclude that their may be a gradation of 
states which are "almost" life mentioned above). Our usual causal narra- 
tive, consisting of bottom-up action only, may therefore be only a subset 
of a broader class of phenomena - including life - which are characterized 
by their causal structure. 
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